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The Colletotrichum plant-like subtili- 
sins (CPLSs) are a family of proteins 
found only in species of the phytopatho- 
genic fungus Colletotrichum. CPLSs 
have high similarity to plant subtilisins 
and our previous work has shown that 
they were acquired by an ancient hori- 
zontal gene transfer event from plants. 
The rapid growth of sequence data in 
public databases enabled us to reexam- 
ine the structure and evolution of the 
CPLSs. A new plant subtilisin structural 
model aided us in refining the tertiary 
structure of CPLSs. Also, new informa- 
tion about protein interactions of plant 
subtilisin has provided new insights into 
the putative function of CPLSs. The 
availability of new genome sequences 
of members of the genus Colletotrichum 
gave us the opportunity to further vali- 
date our hypothesis that the CPLSs are 
unique to the Colletotrichum lineage. 
Together, this information furthers our 
knowledge of the potential role of the 
CPLSs in pathogenicity and the role of 
HGT in the genome evolution of plant 
pathogenic fungi. 

The increasing availability of genomic 
sequences that are available in public data- 
bases has enabled researches to detect rare 
evolutionary events such as Horizontal 
Gene Transfer (HGT). 1 " 10 Today it is 
widely accepted that HGT is not an 
unusual phenomenon. There are many 
reports of HGT between bacteria and 
from bacteria to eukaryotes. However, less 
common are the reports of HGT events 
from plants to fungi. 11 " 14 Our recent report 
of the horizontal transfer of a subtilisin 
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(CPLS; Colletotrichum plant-like sub- 
tilisin) from plants to an ancestor of the 
fungal genus Colletotrichum is particularly 
interesting because it shows the horizontal 
transfer of a subtilisin S8 (family of prote- 
ases involved in many different pathways) 
from plants to a phytopathogen. 15 

A New Structural Model for CPLS 

We previously reported (see ref. 15) a 
prediction of the tertiary protein struc- 
ture of a C. graminicola CPLS (locus ID 
GLRG_05578) using crystallized protein 
structures of subtilisins with sequence 
similarity to CPLSs. The most similar 
one was the SBT3 of Solarium lycopersi- 
cum (PDB ID 3I6S). Recently, the crys- 
tal structure of the subtilisin Cucumisin 
of Cucumis melo was published. 16 With 
this new structure available, we recalcu- 
lated the 3-D model of GLRG_05578 
(the CPLS of Colletotrichum graminicola) 
using the Phyre 2 server. 17 The new struc- 
tural model has only 57 residues predicted 
ab initio (highly unreliable sites) instead of 
73 as reported in Armijos Jaramillo et al. 15 
The new GLRG_05578 structural model 
was aligned to the structural models of 
SBT3 and Cucumisin to determine the 
similarities and the differences with the 
previous model. The structural alignment 
of the previous model of GLRG_05578 
and SBT3 has a maximum RMSD (root— 
mean-square-deviation) of 11.68 and a 
minimum of 0.02. The new model has 
a highest RMSD value of 12.65 and the 
lowest value of 0.03. These results show 
that the new model is less similar to 
SBT3 than the one reported previously. 15 
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Figure 1. Structural alignment of the new model predicted for the tertiary conformation of 
CPLS GLRG_05578 of Colletotrichum graminicola and one monomer of the crystallized subtilisin 
Cucumisin (3VTA) of Cucumis melo. The color spectrum represents the values of pairwise RMSD, 
with blue specifying minimum values and red indicating maximum values. Gray sections are not 
aligned. Labels indicate the relevant domains and sites of the proteins conserved in subtilisin 
SBT3 of Tomato in Ottman et al. 18 



However, the new model is more similar to 
the SBT3 subtilisin than to the Cucumisin 
(Fig. 1) (maximum RMSD of 17.59 and 
minimum RMSD of 3.09 with the old 
model and maximum RMSD of 15.12 
and minimum RMSD of 1.84 with the 
new model). The new model reveals dif- 
ferences in the tertiary protein structures 
of the PA domain, (3-hairpin arrangement 
and one of the putative Ca +2 replacement 
sites between CPLS and tomato subtilisin. 
These sites are important in dimerization 
and stabilization of tomato SBT3 18 and the 
differences with CPLS could imply differ- 
ences in the quaternary structure (confor- 
mation of dimers) and/or the specificity of 
interaction with other proteins. 

Interactions of Plant Subtilisins 
could Reveal the Function of 
CPLSs 

Although the protein targets of the CPLSs 
are unknown, analysis of the most simi- 
lar plant proteins could reveal potential 



clues. A BLASTP search in the UniProt/ 
SwissProt database shows that the most 
similar proteins to CPLSs are 3 subtilisins 
of Arabidopsis thaliana (065351, 064495, 
Q9LLL8) and the Cucumisin of Cucumis 
melo (Q39547). Information about the 
interaction of the Arabidopsis subtilisins 
is available in the STRING database," 
but it is not available for Cucumisin. 
This database shows that 064495 is co- 
expressed with several proteins involved 
in the development of stomata, including 
AT1G63700,AT5G62230,AT1G80080, 20 
AT5G53210, AT3G06120, 21 and others. 
The protein 065351 (ARA12) is pre- 
dicted to interact with AT1G22300 and 
AT4G09000, 2 general regulatory fac- 
tors. 22 Gene Q9LLL8 is co-expressed with 
AT4G35350 and AT1G20850, 23 which 
have peptidase activity and cysteine-type 
peptidase activity, respectively. The inter- 
action and co-expression of these proteins 
could help to explain the role of CPLSs in 
the host and help to plan future studies. 
However, as we commented in Armijos 



Jaramillo et al., 15 the subtilisins in plants 
have a variety of metabolic functions. 
Plant subtilisins with high percentage of 
sequence similarity to CPLSs are involved 
in varied, and sometimes unrelated bio- 
logical process. The CPLS homologs in 
Arabidopsis are a good example of this 
phenomenon. Protein 065351 (ARA12) 
is essential for mucilage release from seed 
coats, whereas 064495 is involved in sto- 
matal development and distribution, and 
Q9LLL8 is involved in negative regulation 
of catalytic activity. The functions of the 
CPLSs are not clear yet, but they could be 
involved in the mis-regulation or mimicry 
of any of those processes. Alternatively, 
novel functions might also be expected for 
CPLS. 

CPLS are Unique to 
Colletotrichum 

Our hypothesis is that the CPLS HGT 
event took place in an ancestor of the 
genus Colletotrichum. Therefore, CPLSs 
should be present in all of the extant spe- 
cies of Colletotrichum and absent in species 
outside the genus. We previously showed 
that the CPLSs are present in the genome 
sequences of all 3 species of Colletotrichum 
that were available at that time but absent 
in all other fungi. Recently, Gan et al., 13 
reported the presence of subtilisins, likely 
resulting from an HGT event, in the 
genomes of C. orbiculare (Cob 12233, Cob 
06327) and C. gloeosporioides (CGGC5 
2662), adding further support to our 
hypothesis. In addition, genome sequences 
for several more species of Colletotrichum 
have recently become available, enabling 
us to further validate our hypothesis. A 
BLAST search of the GenBank nr data- 
base 24 revealed the presence of CPLS 
homologs in the genome of C. gloeospori- 
oides Nara gc5 (GenBank accession num- 
bers: ELA37268 and ELA38265). We 
also performed BLAST searches against 
the proteomes of 2 additional species of 
Colletotrichum (C. gloeosporioides 23 and 
C. fiorinae MH 18, http://www.jgi.doe. 
gov) and found that both species have 
CPLS homologs. A phylogenetic analy- 
sis (data not shown) shows that all of the 
putative CPLS homologs share common 
ancestry with the CPLSs identified in our 
previous report. This idea is consistent 
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with our hypothesis of a plant protein 
that was transferred to an ancestor of the 
genus Colletotrichum. HGT events among 
eukaryotes are presumed to be rare, and 
we assume that those that we observe in 
populations, such as the CPLSs, have been 
fixed due to strong selective pressure. In 
the case of plant pathogenic fungi, such 
as Colletotrichum, horizontally transferred 
genes may have some role in pathogenic- 
ity, enabling the fungus to adapt to new 
hosts or enhancing its ability to colonize 
its host. Research is already underway 
which is aimed at determining whether 
CPLSs have a role in pathogenicity in 
Colletotrichum. 
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